Picture for Kaixin Li

Kaixin Li

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Add code
Jan 29, 2026
Viaarxiv icon

CrownGen: Patient-customized Crown Generation via Point Diffusion Model

Add code
Dec 26, 2025
Figure 1 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 2 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 3 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 4 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Viaarxiv icon

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Add code
Nov 12, 2025
Figure 1 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 2 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 3 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 4 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models

Add code
Oct 31, 2025
Viaarxiv icon

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Add code
Jul 02, 2025
Viaarxiv icon

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Add code
May 18, 2025
Viaarxiv icon

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Add code
May 18, 2025
Figure 1 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 2 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 3 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 4 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Viaarxiv icon

MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation

Add code
Feb 18, 2025
Viaarxiv icon

Robi Butler: Remote Multimodal Interactions with Household Robot Assistant

Add code
Sep 30, 2024
Viaarxiv icon